Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 8616 |
| Missing cells | 19500 |
| Missing cells (%) | 11.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Numeric | 1 |
|---|---|
| Text | 8 |
| Categorical | 10 |
| Unsupported | 1 |
NIVEL has constant value "" | Constant |
Unnamed: 0 is highly overall correlated with DEPARTAMENTO and 2 other fields | High correlation |
DEPARTAMENTO is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
JORNADA is highly overall correlated with PLAN | High correlation |
PLAN is highly overall correlated with JORNADA | High correlation |
DEPARTAMENTAL is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
ZONA is highly overall correlated with Unnamed: 0 and 2 other fields | High correlation |
SECTOR is highly imbalanced (62.5%) | Imbalance |
AREA is highly imbalanced (56.9%) | Imbalance |
STATUS is highly imbalanced (52.9%) | Imbalance |
MODALIDAD is highly imbalanced (80.8%) | Imbalance |
PLAN is highly imbalanced (55.9%) | Imbalance |
DISTRITO has 206 (2.4%) missing values | Missing |
TELEFONO has 1566 (18.2%) missing values | Missing |
SUPERVISOR has 207 (2.4%) missing values | Missing |
DIRECTOR has 1768 (20.5%) missing values | Missing |
CODIGO has 8616 (100.0%) missing values | Missing |
ZONA has 7080 (82.2%) missing values | Missing |
Unnamed: 0 has unique values | Unique |
CODIGO has unique values | Unique |
CODIGO is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-08-11 04:48:19.573003 |
|---|---|
| Analysis finished | 2023-08-11 04:48:24.459804 |
| Duration | 4.89 seconds |
| Software version | ydata-profiling vv4.3.2 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 8616 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4332.8214 |
| Minimum | 0 |
|---|---|
| Maximum | 9022 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 432.75 |
| Q1 | 2157.75 |
| median | 4319.5 |
| Q3 | 6487.25 |
| 95-th percentile | 8222.25 |
| Maximum | 9022 |
| Range | 9022 |
| Interquartile range (IQR) | 4329.5 |
Descriptive statistics
| Standard deviation | 2517.4847 |
|---|---|
| Coefficient of variation (CV) | 0.58102664 |
| Kurtosis | -1.1672341 |
| Mean | 4332.8214 |
| Median Absolute Deviation (MAD) | 2165 |
| Skewness | 0.027487503 |
| Sum | 37331589 |
| Variance | 6337729 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 5760 | 1 | < 0.1% |
| 5774 | 1 | < 0.1% |
| 5773 | 1 | < 0.1% |
| 5772 | 1 | < 0.1% |
| 5771 | 1 | < 0.1% |
| 5770 | 1 | < 0.1% |
| 5769 | 1 | < 0.1% |
| 5768 | 1 | < 0.1% |
| 5767 | 1 | < 0.1% |
| Other values (8606) | 8606 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 9022 | 1 | |
| 9021 | 1 | |
| 9020 | 1 | |
| 9012 | 1 | |
| 9011 | 1 | |
| 9010 | 1 | |
| 9009 | 1 | |
| 9008 | 1 | |
| 9007 | 1 | |
| 9006 | 1 |
CODIGO
Text
UNIQUE 
| Distinct | 8616 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 112008 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8616 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 16-01-0138-46 |
|---|---|
| 2nd row | 16-01-0139-46 |
| 3rd row | 16-01-0140-46 |
| 4th row | 16-01-0141-46 |
| 5th row | 16-01-0142-46 |
| Value | Count | Frequency (%) |
| 16-01-0138-46 | 1 | < 0.1% |
| 16-01-0565-46 | 1 | < 0.1% |
| 16-01-0143-46 | 1 | < 0.1% |
| 16-01-0145-46 | 1 | < 0.1% |
| 16-01-0147-46 | 1 | < 0.1% |
| 16-01-0150-46 | 1 | < 0.1% |
| 16-01-0155-46 | 1 | < 0.1% |
| 16-01-0428-46 | 1 | < 0.1% |
| 16-01-0471-46 | 1 | < 0.1% |
| 16-01-0478-46 | 1 | < 0.1% |
| Other values (8606) | 8606 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 25848 | |
| 0 | 24108 | |
| 1 | 13841 | |
| 4 | 12304 | |
| 6 | 11683 | |
| 2 | 5962 | 5.3% |
| 3 | 4395 | 3.9% |
| 5 | 3816 | 3.4% |
| 8 | 3459 | 3.1% |
| 7 | 3370 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 86160 | |
| Dash Punctuation | 25848 | 23.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24108 | |
| 1 | 13841 | |
| 4 | 12304 | |
| 6 | 11683 | |
| 2 | 5962 | 6.9% |
| 3 | 4395 | 5.1% |
| 5 | 3816 | 4.4% |
| 8 | 3459 | 4.0% |
| 7 | 3370 | 3.9% |
| 9 | 3222 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25848 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 112008 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 25848 | |
| 0 | 24108 | |
| 1 | 13841 | |
| 4 | 12304 | |
| 6 | 11683 | |
| 2 | 5962 | 5.3% |
| 3 | 4395 | 3.9% |
| 5 | 3816 | 3.4% |
| 8 | 3459 | 3.1% |
| 7 | 3370 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 112008 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 25848 | |
| 0 | 24108 | |
| 1 | 13841 | |
| 4 | 12304 | |
| 6 | 11683 | |
| 2 | 5962 | 5.3% |
| 3 | 4395 | 3.9% |
| 5 | 3816 | 3.4% |
| 8 | 3459 | 3.1% |
| 7 | 3370 | 3.0% |
DISTRITO
Text
MISSING 
| Distinct | 642 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 206 |
| Missing (%) | 2.4% |
| Memory size | 67.4 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.9882283 |
| Min length | 3 |
Characters and Unicode
| Total characters | 50361 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 16-031 |
|---|---|
| 2nd row | 16-031 |
| 3rd row | 16-031 |
| 4th row | 16-005 |
| 5th row | 16-005 |
| Value | Count | Frequency (%) |
| 01-403 | 242 | 2.9% |
| 11-017 | 175 | 2.1% |
| 05-033 | 159 | 1.9% |
| 01-411 | 150 | 1.8% |
| 18-008 | 128 | 1.5% |
| 01-409 | 102 | 1.2% |
| 05-007 | 100 | 1.2% |
| 18-039 | 98 | 1.2% |
| 13-004 | 92 | 1.1% |
| 03-002 | 91 | 1.1% |
| Other values (632) | 7073 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14552 | |
| 1 | 10175 | |
| - | 8410 | |
| 2 | 3780 | 7.5% |
| 3 | 3249 | 6.5% |
| 4 | 2513 | 5.0% |
| 6 | 1842 | 3.7% |
| 5 | 1621 | 3.2% |
| 9 | 1513 | 3.0% |
| 7 | 1490 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 41951 | |
| Dash Punctuation | 8410 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 14552 | |
| 1 | 10175 | |
| 2 | 3780 | 9.0% |
| 3 | 3249 | 7.7% |
| 4 | 2513 | 6.0% |
| 6 | 1842 | 4.4% |
| 5 | 1621 | 3.9% |
| 9 | 1513 | 3.6% |
| 7 | 1490 | 3.6% |
| 8 | 1216 | 2.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8410 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50361 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 14552 | |
| 1 | 10175 | |
| - | 8410 | |
| 2 | 3780 | 7.5% |
| 3 | 3249 | 6.5% |
| 4 | 2513 | 5.0% |
| 6 | 1842 | 3.7% |
| 5 | 1621 | 3.2% |
| 9 | 1513 | 3.0% |
| 7 | 1490 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50361 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 14552 | |
| 1 | 10175 | |
| - | 8410 | |
| 2 | 3780 | 7.5% |
| 3 | 3249 | 6.5% |
| 4 | 2513 | 5.0% |
| 6 | 1842 | 3.7% |
| 5 | 1621 | 3.2% |
| 9 | 1513 | 3.0% |
| 7 | 1490 | 3.0% |
DEPARTAMENTO
Categorical
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| GUATEMALA | |
|---|---|
| ESCUINTLA | |
| HUEHUETENANGO | |
| QUETZALTENANGO | |
| PETEN | |
| Other values (15) |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 9.7129759 |
| Min length | 5 |
Characters and Unicode
| Total characters | 83687 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ALTA VERAPAZ |
|---|---|
| 2nd row | ALTA VERAPAZ |
| 3rd row | ALTA VERAPAZ |
| 4th row | ALTA VERAPAZ |
| 5th row | ALTA VERAPAZ |
Common Values
| Value | Count | Frequency (%) |
| GUATEMALA | 2970 | |
| ESCUINTLA | 599 | 7.0% |
| HUEHUETENANGO | 495 | 5.7% |
| QUETZALTENANGO | 476 | 5.5% |
| PETEN | 379 | 4.4% |
| SUCHITEPEQUEZ | 377 | 4.4% |
| IZABAL | 360 | 4.2% |
| CHIMALTENANGO | 349 | 4.1% |
| ALTA VERAPAZ | 348 | 4.0% |
| SAN MARCOS | 332 | 3.9% |
| Other values (10) | 1931 |
Length
| Value | Count | Frequency (%) |
| guatemala | 2970 | |
| escuintla | 599 | 6.3% |
| huehuetenango | 495 | 5.2% |
| quetzaltenango | 476 | 5.0% |
| verapaz | 468 | 4.9% |
| peten | 379 | 4.0% |
| suchitepequez | 377 | 4.0% |
| izabal | 360 | 3.8% |
| chimaltenango | 349 | 3.7% |
| alta | 348 | 3.6% |
| Other values (13) | 2717 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 10750 | |
| U | 7627 | |
| T | 7590 | |
| L | 6167 | 7.4% |
| G | 4412 | 5.3% |
| N | 4130 | 4.9% |
| M | 3823 | 4.6% |
| I | 2681 | 3.2% |
| C | 2566 | 3.1% |
| Other values (11) | 16376 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 82765 | |
| Space Separator | 922 | 1.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 10750 | |
| U | 7627 | |
| T | 7590 | |
| L | 6167 | 7.5% |
| G | 4412 | 5.3% |
| N | 4130 | 5.0% |
| M | 3823 | 4.6% |
| I | 2681 | 3.2% |
| C | 2566 | 3.1% |
| Other values (10) | 15454 |
Space Separator
| Value | Count | Frequency (%) |
| 922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 82765 | |
| Common | 922 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 10750 | |
| U | 7627 | |
| T | 7590 | |
| L | 6167 | 7.5% |
| G | 4412 | 5.3% |
| N | 4130 | 5.0% |
| M | 3823 | 4.6% |
| I | 2681 | 3.2% |
| C | 2566 | 3.1% |
| Other values (10) | 15454 |
Common
| Value | Count | Frequency (%) |
| 922 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83687 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 10750 | |
| U | 7627 | |
| T | 7590 | |
| L | 6167 | 7.4% |
| G | 4412 | 5.3% |
| N | 4130 | 4.9% |
| M | 3823 | 4.6% |
| I | 2681 | 3.2% |
| C | 2566 | 3.1% |
| Other values (11) | 16376 |
MUNICIPIO
Text
| Distinct | 299 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
Length
| Max length | 28 |
|---|---|
| Median length | 24 |
| Mean length | 12.228877 |
| Min length | 4 |
Characters and Unicode
| Total characters | 105364 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | COBAN |
|---|---|
| 2nd row | COBAN |
| 3rd row | COBAN |
| 4th row | COBAN |
| 5th row | COBAN |
| Value | Count | Frequency (%) |
| ciudad | 1565 | 10.4% |
| capital | 1536 | 10.2% |
| san | 1357 | 9.0% |
| villa | 455 | 3.0% |
| mixco | 420 | 2.8% |
| nueva | 400 | 2.7% |
| santa | 354 | 2.4% |
| la | 265 | 1.8% |
| quetzaltenango | 241 | 1.6% |
| miguel | 193 | 1.3% |
| Other values (305) | 8227 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 19529 | |
| I | 7623 | 7.2% |
| C | 7333 | 7.0% |
| N | 7066 | 6.7% |
| T | 6947 | 6.6% |
| L | 6788 | 6.4% |
| E | 6492 | 6.2% |
| U | 6488 | 6.2% |
| 6397 | 6.1% | |
| O | 4743 | 4.5% |
| Other values (15) | 25958 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 98967 | |
| Space Separator | 6397 | 6.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 19529 | |
| I | 7623 | 7.7% |
| C | 7333 | 7.4% |
| N | 7066 | 7.1% |
| T | 6947 | 7.0% |
| L | 6788 | 6.9% |
| E | 6492 | 6.6% |
| U | 6488 | 6.6% |
| O | 4743 | 4.8% |
| S | 4587 | 4.6% |
| Other values (14) | 21371 |
Space Separator
| Value | Count | Frequency (%) |
| 6397 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 98967 | |
| Common | 6397 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 19529 | |
| I | 7623 | 7.7% |
| C | 7333 | 7.4% |
| N | 7066 | 7.1% |
| T | 6947 | 7.0% |
| L | 6788 | 6.9% |
| E | 6492 | 6.6% |
| U | 6488 | 6.6% |
| O | 4743 | 4.8% |
| S | 4587 | 4.6% |
| Other values (14) | 21371 |
Common
| Value | Count | Frequency (%) |
| 6397 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 105364 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 19529 | |
| I | 7623 | 7.2% |
| C | 7333 | 7.0% |
| N | 7066 | 6.7% |
| T | 6947 | 6.6% |
| L | 6788 | 6.4% |
| E | 6492 | 6.2% |
| U | 6488 | 6.2% |
| 6397 | 6.1% | |
| O | 4743 | 4.5% |
| Other values (15) | 25958 |
ESTABLECIMIENTO
Text
| Distinct | 4609 |
|---|---|
| Distinct (%) | 53.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
Length
| Max length | 125 |
|---|---|
| Median length | 103 |
| Mean length | 40.211931 |
| Min length | 3 |
Characters and Unicode
| Total characters | 346466 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2963 ? |
|---|---|
| Unique (%) | 34.4% |
Sample
| 1st row | COLEGIO COBAN |
|---|---|
| 2nd row | COLEGIO PARTICULAR MIXTO VERAPAZ |
| 3rd row | COLEGIO "LA INMACULADA" |
| 4th row | ESCUELA NACIONAL DE CIENCIAS COMERCIALES |
| 5th row | INSTITUTO NORMAL MIXTO DEL NORTE 'EMILIO ROSALES PONCE' |
| Value | Count | Frequency (%) |
| de | 3500 | 7.7% |
| colegio | 3326 | 7.3% |
| mixto | 2619 | 5.8% |
| instituto | 2437 | 5.4% |
| liceo | 1610 | 3.5% |
| educacion | 1364 | 3.0% |
| privado | 1333 | 2.9% |
| centro | 1127 | 2.5% |
| diversificada | 772 | 1.7% |
| educativo | 742 | 1.6% |
| Other values (2973) | 26568 |
Most occurring characters
| Value | Count | Frequency (%) |
| 36806 | ||
| I | 36497 | |
| O | 34258 | |
| E | 30600 | 8.8% |
| A | 29397 | 8.5% |
| C | 24226 | 7.0% |
| T | 21605 | 6.2% |
| N | 19971 | 5.8% |
| L | 16256 | 4.7% |
| R | 15742 | 4.5% |
| Other values (39) | 81108 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 303421 | |
| Space Separator | 36806 | 10.6% |
| Other Punctuation | 4714 | 1.4% |
| Dash Punctuation | 769 | 0.2% |
| Decimal Number | 356 | 0.1% |
| Open Punctuation | 199 | 0.1% |
| Close Punctuation | 198 | 0.1% |
| Modifier Symbol | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 36497 | |
| O | 34258 | |
| E | 30600 | |
| A | 29397 | |
| C | 24226 | 8.0% |
| T | 21605 | 7.1% |
| N | 19971 | 6.6% |
| L | 16256 | 5.4% |
| R | 15742 | 5.2% |
| D | 13007 | 4.3% |
| Other values (16) | 61862 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 125 | |
| 0 | 71 | |
| 1 | 57 | |
| 3 | 34 | 9.6% |
| 4 | 20 | 5.6% |
| 7 | 17 | 4.8% |
| 6 | 11 | 3.1% |
| 9 | 7 | 2.0% |
| 8 | 7 | 2.0% |
| 5 | 7 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 2913 | |
| ' | 893 | 18.9% |
| . | 783 | 16.6% |
| , | 106 | 2.2% |
| & | 9 | 0.2% |
| / | 7 | 0.1% |
| % | 2 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 36806 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 769 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 199 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 198 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 303421 | |
| Common | 43045 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 36497 | |
| O | 34258 | |
| E | 30600 | |
| A | 29397 | |
| C | 24226 | 8.0% |
| T | 21605 | 7.1% |
| N | 19971 | 6.6% |
| L | 16256 | 5.4% |
| R | 15742 | 5.2% |
| D | 13007 | 4.3% |
| Other values (16) | 61862 |
Common
| Value | Count | Frequency (%) |
| 36806 | ||
| " | 2913 | 6.8% |
| ' | 893 | 2.1% |
| . | 783 | 1.8% |
| - | 769 | 1.8% |
| ( | 199 | 0.5% |
| ) | 198 | 0.5% |
| 2 | 125 | 0.3% |
| , | 106 | 0.2% |
| 0 | 71 | 0.2% |
| Other values (13) | 182 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 346466 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 36806 | ||
| I | 36497 | |
| O | 34258 | |
| E | 30600 | 8.8% |
| A | 29397 | 8.5% |
| C | 24226 | 7.0% |
| T | 21605 | 6.2% |
| N | 19971 | 5.8% |
| L | 16256 | 4.7% |
| R | 15742 | 4.5% |
| Other values (39) | 81108 |
DIRECCION
Text
| Distinct | 5522 |
|---|---|
| Distinct (%) | 64.5% |
| Missing | 57 |
| Missing (%) | 0.7% |
| Memory size | 67.4 KiB |
Length
| Max length | 110 |
|---|---|
| Median length | 91 |
| Mean length | 28.49188 |
| Min length | 4 |
Characters and Unicode
| Total characters | 243862 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4095 ? |
|---|---|
| Unique (%) | 47.8% |
Sample
| 1st row | KM.2 SALIDA A SAN JUAN CHAMELCO ZONA 8 |
|---|---|
| 2nd row | KM 209.5 ENTRADA A LA CIUDAD |
| 3rd row | 7A. AVENIDA 11-109 ZONA 6 |
| 4th row | 2A CALLE 11-10 ZONA 2 |
| 5th row | 3A AVE 6-23 ZONA 11 |
| Value | Count | Frequency (%) |
| zona | 3808 | 8.3% |
| calle | 2729 | 6.0% |
| avenida | 2085 | 4.6% |
| 1 | 1686 | 3.7% |
| barrio | 1052 | 2.3% |
| colonia | 1029 | 2.2% |
| aldea | 988 | 2.2% |
| el | 877 | 1.9% |
| san | 820 | 1.8% |
| 2 | 610 | 1.3% |
| Other values (3473) | 30096 |
Most occurring characters
| Value | Count | Frequency (%) |
| 37221 | ||
| A | 35423 | |
| E | 15615 | 6.4% |
| L | 15241 | 6.2% |
| N | 14573 | 6.0% |
| O | 14533 | 6.0% |
| I | 11137 | 4.6% |
| C | 9645 | 4.0% |
| R | 8804 | 3.6% |
| 1 | 6600 | 2.7% |
| Other values (39) | 75070 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 165000 | |
| Space Separator | 37221 | 15.3% |
| Decimal Number | 28893 | 11.8% |
| Other Punctuation | 7874 | 3.2% |
| Dash Punctuation | 4820 | 2.0% |
| Lowercase Letter | 22 | < 0.1% |
| Open Punctuation | 16 | < 0.1% |
| Close Punctuation | 16 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 35423 | |
| E | 15615 | |
| L | 15241 | |
| N | 14573 | |
| O | 14533 | |
| I | 11137 | 6.7% |
| C | 9645 | 5.8% |
| R | 8804 | 5.3% |
| D | 5942 | 3.6% |
| T | 5398 | 3.3% |
| Other values (16) | 28689 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6600 | |
| 2 | 4025 | |
| 3 | 3470 | |
| 4 | 2945 | |
| 5 | 2718 | |
| 0 | 2365 | 8.2% |
| 6 | 2078 | 7.2% |
| 7 | 1770 | 6.1% |
| 8 | 1463 | 5.1% |
| 9 | 1459 | 5.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4799 | |
| , | 2458 | |
| " | 469 | 6.0% |
| ' | 92 | 1.2% |
| / | 36 | 0.5% |
| # | 19 | 0.2% |
| ; | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 20 | |
| o | 2 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 37221 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4820 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 165022 | |
| Common | 78840 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 35423 | |
| E | 15615 | |
| L | 15241 | |
| N | 14573 | |
| O | 14533 | |
| I | 11137 | 6.7% |
| C | 9645 | 5.8% |
| R | 8804 | 5.3% |
| D | 5942 | 3.6% |
| T | 5398 | 3.3% |
| Other values (18) | 28711 |
Common
| Value | Count | Frequency (%) |
| 37221 | ||
| 1 | 6600 | 8.4% |
| - | 4820 | 6.1% |
| . | 4799 | 6.1% |
| 2 | 4025 | 5.1% |
| 3 | 3470 | 4.4% |
| 4 | 2945 | 3.7% |
| 5 | 2718 | 3.4% |
| , | 2458 | 3.1% |
| 0 | 2365 | 3.0% |
| Other values (11) | 7419 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 243862 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 37221 | ||
| A | 35423 | |
| E | 15615 | 6.4% |
| L | 15241 | 6.2% |
| N | 14573 | 6.0% |
| O | 14533 | 6.0% |
| I | 11137 | 4.6% |
| C | 9645 | 4.0% |
| R | 8804 | 3.6% |
| 1 | 6600 | 2.7% |
| Other values (39) | 75070 |
TELEFONO
Text
MISSING 
| Distinct | 4322 |
|---|---|
| Distinct (%) | 61.3% |
| Missing | 1566 |
| Missing (%) | 18.2% |
| Memory size | 67.4 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 56400 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2927 ? |
|---|---|
| Unique (%) | 41.5% |
Sample
| 1st row | 77945104 |
|---|---|
| 2nd row | 77367402 |
| 3rd row | 78232301 |
| 4th row | 79514215 |
| 5th row | 79521468 |
| Value | Count | Frequency (%) |
| 22067425 | 21 | 0.3% |
| 79480009 | 14 | 0.2% |
| 22093200 | 12 | 0.2% |
| 45353648 | 11 | 0.2% |
| 77746400 | 11 | 0.2% |
| 59304894 | 11 | 0.2% |
| 78899679 | 10 | 0.1% |
| 22322912 | 10 | 0.1% |
| 24637777 | 10 | 0.1% |
| 78394519 | 9 | 0.1% |
| Other values (4314) | 6934 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 7296 | |
| 7 | 6780 | |
| 4 | 6339 | |
| 5 | 5957 | |
| 3 | 5759 | |
| 0 | 5087 | |
| 8 | 5045 | |
| 6 | 4981 | |
| 1 | 4583 | |
| 9 | 4535 | |
| Other values (6) | 38 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 56362 | |
| Dash Punctuation | 20 | < 0.1% |
| Other Punctuation | 9 | < 0.1% |
| Space Separator | 7 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7296 | |
| 7 | 6780 | |
| 4 | 6339 | |
| 5 | 5957 | |
| 3 | 5759 | |
| 0 | 5087 | |
| 8 | 5045 | |
| 6 | 4981 | |
| 1 | 4583 | |
| 9 | 4535 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8 | |
| / | 1 | 11.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 1 | |
| E | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 56398 | |
| Latin | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 7296 | |
| 7 | 6780 | |
| 4 | 6339 | |
| 5 | 5957 | |
| 3 | 5759 | |
| 0 | 5087 | |
| 8 | 5045 | |
| 6 | 4981 | |
| 1 | 4583 | |
| 9 | 4535 | |
| Other values (4) | 36 | 0.1% |
Latin
| Value | Count | Frequency (%) |
| Y | 1 | |
| E | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 7296 | |
| 7 | 6780 | |
| 4 | 6339 | |
| 5 | 5957 | |
| 3 | 5759 | |
| 0 | 5087 | |
| 8 | 5045 | |
| 6 | 4981 | |
| 1 | 4583 | |
| 9 | 4535 | |
| Other values (6) | 38 | 0.1% |
SUPERVISOR
Text
MISSING 
| Distinct | 608 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 207 |
| Missing (%) | 2.4% |
| Memory size | 67.4 KiB |
Length
| Max length | 63 |
|---|---|
| Median length | 44 |
| Mean length | 29.097039 |
| Min length | 14 |
Characters and Unicode
| Total characters | 244677 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 70 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | MERCEDES JOSEFINA TORRES GALVEZ |
|---|---|
| 2nd row | MERCEDES JOSEFINA TORRES GALVEZ |
| 3rd row | MERCEDES JOSEFINA TORRES GALVEZ |
| 4th row | RUDY ADOLFO TOT OCH |
| 5th row | RUDY ADOLFO TOT OCH |
| Value | Count | Frequency (%) |
| de | 1959 | 5.4% |
| lopez | 592 | 1.6% |
| martinez | 572 | 1.6% |
| leon | 543 | 1.5% |
| gonzalez | 488 | 1.4% |
| juan | 457 | 1.3% |
| carlos | 396 | 1.1% |
| morales | 389 | 1.1% |
| hernandez | 356 | 1.0% |
| humberto | 327 | 0.9% |
| Other values (1094) | 30050 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 30691 | |
| 27720 | ||
| E | 23149 | 9.5% |
| R | 19345 | 7.9% |
| O | 18763 | 7.7% |
| I | 16333 | 6.7% |
| L | 15441 | 6.3% |
| N | 14367 | 5.9% |
| S | 10000 | 4.1% |
| D | 8202 | 3.4% |
| Other values (19) | 60666 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 216827 | |
| Space Separator | 27720 | 11.3% |
| Dash Punctuation | 124 | 0.1% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 30691 | |
| E | 23149 | |
| R | 19345 | 8.9% |
| O | 18763 | 8.7% |
| I | 16333 | 7.5% |
| L | 15441 | 7.1% |
| N | 14367 | 6.6% |
| S | 10000 | 4.6% |
| D | 8202 | 3.8% |
| C | 8033 | 3.7% |
| Other values (16) | 52503 |
Space Separator
| Value | Count | Frequency (%) |
| 27720 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 124 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 216827 | |
| Common | 27850 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 30691 | |
| E | 23149 | |
| R | 19345 | 8.9% |
| O | 18763 | 8.7% |
| I | 16333 | 7.5% |
| L | 15441 | 7.1% |
| N | 14367 | 6.6% |
| S | 10000 | 4.6% |
| D | 8202 | 3.8% |
| C | 8033 | 3.7% |
| Other values (16) | 52503 |
Common
| Value | Count | Frequency (%) |
| 27720 | ||
| - | 124 | 0.4% |
| . | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 30691 | |
| 27720 | ||
| E | 23149 | 9.5% |
| R | 19345 | 7.9% |
| O | 18763 | 7.7% |
| I | 16333 | 6.7% |
| L | 15441 | 6.3% |
| N | 14367 | 5.9% |
| S | 10000 | 4.1% |
| D | 8202 | 3.4% |
| Other values (19) | 60666 |
DIRECTOR
Text
MISSING 
| Distinct | 4272 |
|---|---|
| Distinct (%) | 62.4% |
| Missing | 1768 |
| Missing (%) | 20.5% |
| Memory size | 67.4 KiB |
Length
| Max length | 57 |
|---|---|
| Median length | 48 |
| Mean length | 28.673481 |
| Min length | 1 |
Characters and Unicode
| Total characters | 196356 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2914 ? |
|---|---|
| Unique (%) | 42.6% |
Sample
| 1st row | JULIO CESAR VILLELA AMADO |
|---|---|
| 2nd row | VIRGINA SOLANO SERRANO |
| 3rd row | HECOTR WALDEMAR TOT COY |
| 4th row | LUIS FERNANDO SOTO |
| 5th row | MERCEDES QUIROS QUIROS |
| Value | Count | Frequency (%) |
| de | 1329 | 4.6% |
| lopez | 557 | 1.9% |
| garcia | 343 | 1.2% |
| maria | 332 | 1.1% |
| hernandez | 321 | 1.1% |
| morales | 284 | 1.0% |
| perez | 269 | 0.9% |
| gonzalez | 229 | 0.8% |
| jose | 203 | 0.7% |
| ramirez | 199 | 0.7% |
| Other values (3597) | 24838 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 25943 | |
| 22056 | ||
| E | 18879 | 9.6% |
| R | 15587 | 7.9% |
| O | 14215 | 7.2% |
| I | 13285 | 6.8% |
| L | 12166 | 6.2% |
| N | 11437 | 5.8% |
| S | 8040 | 4.1% |
| D | 7237 | 3.7% |
| Other values (24) | 47511 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 174149 | |
| Space Separator | 22056 | 11.2% |
| Other Punctuation | 77 | < 0.1% |
| Dash Punctuation | 71 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 25943 | |
| E | 18879 | |
| R | 15587 | 9.0% |
| O | 14215 | 8.2% |
| I | 13285 | 7.6% |
| L | 12166 | 7.0% |
| N | 11437 | 6.6% |
| S | 8040 | 4.6% |
| D | 7237 | 4.2% |
| C | 6214 | 3.6% |
| Other values (16) | 41146 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 70 | |
| , | 5 | 6.5% |
| " | 2 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 22056 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 71 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 174149 | |
| Common | 22207 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 25943 | |
| E | 18879 | |
| R | 15587 | 9.0% |
| O | 14215 | 8.2% |
| I | 13285 | 7.6% |
| L | 12166 | 7.0% |
| N | 11437 | 6.6% |
| S | 8040 | 4.6% |
| D | 7237 | 4.2% |
| C | 6214 | 3.6% |
| Other values (16) | 41146 |
Common
| Value | Count | Frequency (%) |
| 22056 | ||
| - | 71 | 0.3% |
| . | 70 | 0.3% |
| , | 5 | < 0.1% |
| " | 2 | < 0.1% |
| + | 1 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 196356 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 25943 | |
| 22056 | ||
| E | 18879 | 9.6% |
| R | 15587 | 7.9% |
| O | 14215 | 7.2% |
| I | 13285 | 6.8% |
| L | 12166 | 6.2% |
| N | 11437 | 5.8% |
| S | 8040 | 4.1% |
| D | 7237 | 3.7% |
| Other values (24) | 47511 |
NIVEL
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| DIVERSIFICADO |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 112008 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DIVERSIFICADO |
|---|---|
| 2nd row | DIVERSIFICADO |
| 3rd row | DIVERSIFICADO |
| 4th row | DIVERSIFICADO |
| 5th row | DIVERSIFICADO |
Common Values
| Value | Count | Frequency (%) |
| DIVERSIFICADO | 8616 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| diversificado | 8616 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 25848 | |
| D | 17232 | |
| V | 8616 | 7.7% |
| E | 8616 | 7.7% |
| R | 8616 | 7.7% |
| S | 8616 | 7.7% |
| F | 8616 | 7.7% |
| C | 8616 | 7.7% |
| A | 8616 | 7.7% |
| O | 8616 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 112008 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 25848 | |
| D | 17232 | |
| V | 8616 | 7.7% |
| E | 8616 | 7.7% |
| R | 8616 | 7.7% |
| S | 8616 | 7.7% |
| F | 8616 | 7.7% |
| C | 8616 | 7.7% |
| A | 8616 | 7.7% |
| O | 8616 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 112008 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 25848 | |
| D | 17232 | |
| V | 8616 | 7.7% |
| E | 8616 | 7.7% |
| R | 8616 | 7.7% |
| S | 8616 | 7.7% |
| F | 8616 | 7.7% |
| C | 8616 | 7.7% |
| A | 8616 | 7.7% |
| O | 8616 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 112008 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 25848 | |
| D | 17232 | |
| V | 8616 | 7.7% |
| E | 8616 | 7.7% |
| R | 8616 | 7.7% |
| S | 8616 | 7.7% |
| F | 8616 | 7.7% |
| C | 8616 | 7.7% |
| A | 8616 | 7.7% |
| O | 8616 | 7.7% |
SECTOR
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| PRIVADO | |
|---|---|
| OFICIAL | |
| COOPERATIVA | 200 |
| MUNICIPAL | 134 |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.1239554 |
| Min length | 7 |
Characters and Unicode
| Total characters | 61380 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRIVADO |
|---|---|
| 2nd row | PRIVADO |
| 3rd row | PRIVADO |
| 4th row | OFICIAL |
| 5th row | OFICIAL |
Common Values
| Value | Count | Frequency (%) |
| PRIVADO | 7383 | |
| OFICIAL | 899 | 10.4% |
| COOPERATIVA | 200 | 2.3% |
| MUNICIPAL | 134 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| privado | 7383 | |
| oficial | 899 | 10.4% |
| cooperativa | 200 | 2.3% |
| municipal | 134 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 9649 | |
| A | 8816 | |
| O | 8682 | |
| P | 7717 | |
| R | 7583 | |
| V | 7583 | |
| D | 7383 | |
| C | 1233 | 2.0% |
| L | 1033 | 1.7% |
| F | 899 | 1.5% |
| Other values (5) | 802 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 61380 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 9649 | |
| A | 8816 | |
| O | 8682 | |
| P | 7717 | |
| R | 7583 | |
| V | 7583 | |
| D | 7383 | |
| C | 1233 | 2.0% |
| L | 1033 | 1.7% |
| F | 899 | 1.5% |
| Other values (5) | 802 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61380 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 9649 | |
| A | 8816 | |
| O | 8682 | |
| P | 7717 | |
| R | 7583 | |
| V | 7583 | |
| D | 7383 | |
| C | 1233 | 2.0% |
| L | 1033 | 1.7% |
| F | 899 | 1.5% |
| Other values (5) | 802 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 9649 | |
| A | 8816 | |
| O | 8682 | |
| P | 7717 | |
| R | 7583 | |
| V | 7583 | |
| D | 7383 | |
| C | 1233 | 2.0% |
| L | 1033 | 1.7% |
| F | 899 | 1.5% |
| Other values (5) | 802 | 1.3% |
AREA
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| URBANA | |
|---|---|
| RURAL | |
| SIN ESPECIFICAR | 1 |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 5.8201021 |
| Min length | 5 |
Characters and Unicode
| Total characters | 50146 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | URBANA |
|---|---|
| 2nd row | URBANA |
| 3rd row | URBANA |
| 4th row | URBANA |
| 5th row | URBANA |
Common Values
| Value | Count | Frequency (%) |
| URBANA | 7056 | |
| RURAL | 1559 | 18.1% |
| SIN ESPECIFICAR | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| urbana | 7056 | |
| rural | 1559 | 18.1% |
| sin | 1 | < 0.1% |
| especificar | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15672 | |
| R | 10175 | |
| U | 8615 | |
| N | 7057 | |
| B | 7056 | |
| L | 1559 | 3.1% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 50145 | |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15672 | |
| R | 10175 | |
| U | 8615 | |
| N | 7057 | |
| B | 7056 | |
| L | 1559 | 3.1% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50145 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 15672 | |
| R | 10175 | |
| U | 8615 | |
| N | 7057 | |
| B | 7056 | |
| L | 1559 | 3.1% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 15672 | |
| R | 10175 | |
| U | 8615 | |
| N | 7057 | |
| B | 7056 | |
| L | 1559 | 3.1% |
| I | 3 | < 0.1% |
| S | 2 | < 0.1% |
| E | 2 | < 0.1% |
| C | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
STATUS
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| ABIERTA | |
|---|---|
| CERRADA TEMPORALMENTE | |
| TEMPORAL TITULOS | 116 |
| TEMPORAL NOMBRAMIENTO | 3 |
Length
| Max length | 21 |
|---|---|
| Median length | 7 |
| Mean length | 10.892526 |
| Min length | 7 |
Characters and Unicode
| Total characters | 93850 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ABIERTA |
|---|---|
| 2nd row | ABIERTA |
| 3rd row | ABIERTA |
| 4th row | ABIERTA |
| 5th row | ABIERTA |
Common Values
| Value | Count | Frequency (%) |
| ABIERTA | 6179 | |
| CERRADA TEMPORALMENTE | 2318 | 26.9% |
| TEMPORAL TITULOS | 116 | 1.3% |
| TEMPORAL NOMBRAMIENTO | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| abierta | 6179 | |
| cerrada | 2318 | 21.0% |
| temporalmente | 2318 | 21.0% |
| temporal | 119 | 1.1% |
| titulos | 116 | 1.0% |
| nombramiento | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 19434 | |
| E | 15573 | |
| R | 13255 | |
| T | 11169 | |
| I | 6298 | 6.7% |
| B | 6182 | 6.6% |
| M | 4761 | 5.1% |
| O | 2559 | 2.7% |
| L | 2553 | 2.7% |
| 2437 | 2.6% | |
| Other values (6) | 9629 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 91413 | |
| Space Separator | 2437 | 2.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 19434 | |
| E | 15573 | |
| R | 13255 | |
| T | 11169 | |
| I | 6298 | 6.9% |
| B | 6182 | 6.8% |
| M | 4761 | 5.2% |
| O | 2559 | 2.8% |
| L | 2553 | 2.8% |
| P | 2437 | 2.7% |
| Other values (5) | 7192 | 7.9% |
Space Separator
| Value | Count | Frequency (%) |
| 2437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 91413 | |
| Common | 2437 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 19434 | |
| E | 15573 | |
| R | 13255 | |
| T | 11169 | |
| I | 6298 | 6.9% |
| B | 6182 | 6.8% |
| M | 4761 | 5.2% |
| O | 2559 | 2.8% |
| L | 2553 | 2.8% |
| P | 2437 | 2.7% |
| Other values (5) | 7192 | 7.9% |
Common
| Value | Count | Frequency (%) |
| 2437 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 19434 | |
| E | 15573 | |
| R | 13255 | |
| T | 11169 | |
| I | 6298 | 6.7% |
| B | 6182 | 6.6% |
| M | 4761 | 5.1% |
| O | 2559 | 2.7% |
| L | 2553 | 2.7% |
| 2437 | 2.6% | |
| Other values (6) | 9629 |
MODALIDAD
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| MONOLINGUE | |
|---|---|
| BILINGUE | 254 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9410399 |
| Min length | 8 |
Characters and Unicode
| Total characters | 85652 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MONOLINGUE |
|---|---|
| 2nd row | MONOLINGUE |
| 3rd row | MONOLINGUE |
| 4th row | MONOLINGUE |
| 5th row | BILINGUE |
Common Values
| Value | Count | Frequency (%) |
| MONOLINGUE | 8362 | |
| BILINGUE | 254 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| monolingue | 8362 | |
| bilingue | 254 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 16978 | |
| O | 16724 | |
| I | 8870 | |
| L | 8616 | |
| G | 8616 | |
| U | 8616 | |
| E | 8616 | |
| M | 8362 | |
| B | 254 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 85652 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 16978 | |
| O | 16724 | |
| I | 8870 | |
| L | 8616 | |
| G | 8616 | |
| U | 8616 | |
| E | 8616 | |
| M | 8362 | |
| B | 254 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85652 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 16978 | |
| O | 16724 | |
| I | 8870 | |
| L | 8616 | |
| G | 8616 | |
| U | 8616 | |
| E | 8616 | |
| M | 8362 | |
| B | 254 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85652 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 16978 | |
| O | 16724 | |
| I | 8870 | |
| L | 8616 | |
| G | 8616 | |
| U | 8616 | |
| E | 8616 | |
| M | 8362 | |
| B | 254 | 0.3% |
JORNADA
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| DOBLE | |
|---|---|
| VESPERTINA | |
| MATUTINA | |
| SIN JORNADA | |
| NOCTURNA | 279 |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 7.8535283 |
| Min length | 5 |
Characters and Unicode
| Total characters | 67666 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MATUTINA |
|---|---|
| 2nd row | MATUTINA |
| 3rd row | MATUTINA |
| 4th row | MATUTINA |
| 5th row | VESPERTINA |
Common Values
| Value | Count | Frequency (%) |
| DOBLE | 2866 | |
| VESPERTINA | 2328 | |
| MATUTINA | 2221 | |
| SIN JORNADA | 836 | 9.7% |
| NOCTURNA | 279 | 3.2% |
| INTERMEDIA | 86 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| doble | 2866 | |
| vespertina | 2328 | |
| matutina | 2221 | |
| sin | 836 | 8.8% |
| jornada | 836 | 8.8% |
| nocturna | 279 | 3.0% |
| intermedia | 86 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 8807 | |
| E | 7694 | |
| T | 7135 | |
| N | 6865 | |
| I | 5557 | 8.2% |
| O | 3981 | 5.9% |
| D | 3788 | 5.6% |
| R | 3529 | 5.2% |
| S | 3164 | 4.7% |
| L | 2866 | 4.2% |
| Other values (8) | 14280 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 66830 | |
| Space Separator | 836 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 8807 | |
| E | 7694 | |
| T | 7135 | |
| N | 6865 | |
| I | 5557 | |
| O | 3981 | 6.0% |
| D | 3788 | 5.7% |
| R | 3529 | 5.3% |
| S | 3164 | 4.7% |
| L | 2866 | 4.3% |
| Other values (7) | 13444 |
Space Separator
| Value | Count | Frequency (%) |
| 836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 66830 | |
| Common | 836 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 8807 | |
| E | 7694 | |
| T | 7135 | |
| N | 6865 | |
| I | 5557 | |
| O | 3981 | 6.0% |
| D | 3788 | 5.7% |
| R | 3529 | 5.3% |
| S | 3164 | 4.7% |
| L | 2866 | 4.3% |
| Other values (7) | 13444 |
Common
| Value | Count | Frequency (%) |
| 836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67666 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 8807 | |
| E | 7694 | |
| T | 7135 | |
| N | 6865 | |
| I | 5557 | 8.2% |
| O | 3981 | 5.9% |
| D | 3788 | 5.6% |
| R | 3529 | 5.2% |
| S | 3164 | 4.7% |
| L | 2866 | 4.2% |
| Other values (8) | 14280 |
PLAN
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| DIARIO(REGULAR) | |
|---|---|
| FIN DE SEMANA | |
| SEMIPRESENCIAL (FIN DE SEMANA) | 424 |
| SEMIPRESENCIAL (UN DIA A LA SEMANA) | 337 |
| A DISTANCIA | 133 |
| Other values (8) | 229 |
Length
| Max length | 37 |
|---|---|
| Median length | 15 |
| Mean length | 16.030989 |
| Min length | 5 |
Characters and Unicode
| Total characters | 138123 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DIARIO(REGULAR) |
|---|---|
| 2nd row | DIARIO(REGULAR) |
| 3rd row | DIARIO(REGULAR) |
| 4th row | DIARIO(REGULAR) |
| 5th row | DIARIO(REGULAR) |
Common Values
| Value | Count | Frequency (%) |
| DIARIO(REGULAR) | 5239 | |
| FIN DE SEMANA | 2254 | |
| SEMIPRESENCIAL (FIN DE SEMANA) | 424 | 4.9% |
| SEMIPRESENCIAL (UN DIA A LA SEMANA) | 337 | 3.9% |
| A DISTANCIA | 133 | 1.5% |
| SEMIPRESENCIAL | 75 | 0.9% |
| SEMIPRESENCIAL (DOS DIAS A LA SEMANA) | 52 | 0.6% |
| VIRTUAL A DISTANCIA | 41 | 0.5% |
| SABATINO | 40 | 0.5% |
| DOMINICAL | 15 | 0.2% |
| Other values (3) | 6 | 0.1% |
Length
| Value | Count | Frequency (%) |
| diario(regular | 5239 | |
| semana | 3067 | |
| fin | 2678 | |
| de | 2678 | |
| semipresencial | 888 | 5.4% |
| a | 563 | 3.4% |
| la | 389 | 2.3% |
| un | 337 | 2.0% |
| dia | 337 | 2.0% |
| distancia | 174 | 1.1% |
| Other values (8) | 206 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 19331 | |
| R | 16654 | |
| I | 15786 | |
| E | 13652 | |
| D | 8549 | 6.2% |
| 7940 | 5.7% | |
| N | 7201 | 5.2% |
| L | 6576 | 4.8% |
| ( | 6052 | 4.4% |
| ) | 6052 | 4.4% |
| Other values (12) | 30330 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 118079 | |
| Space Separator | 7940 | 5.7% |
| Open Punctuation | 6052 | 4.4% |
| Close Punctuation | 6052 | 4.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 19331 | |
| R | 16654 | |
| I | 15786 | |
| E | 13652 | |
| D | 8549 | |
| N | 7201 | 6.1% |
| L | 6576 | 5.6% |
| U | 5619 | 4.8% |
| O | 5350 | 4.5% |
| G | 5241 | 4.4% |
| Other values (9) | 14120 |
Space Separator
| Value | Count | Frequency (%) |
| 7940 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6052 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6052 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 118079 | |
| Common | 20044 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 19331 | |
| R | 16654 | |
| I | 15786 | |
| E | 13652 | |
| D | 8549 | |
| N | 7201 | 6.1% |
| L | 6576 | 5.6% |
| U | 5619 | 4.8% |
| O | 5350 | 4.5% |
| G | 5241 | 4.4% |
| Other values (9) | 14120 |
Common
| Value | Count | Frequency (%) |
| 7940 | ||
| ( | 6052 | |
| ) | 6052 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 138123 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 19331 | |
| R | 16654 | |
| I | 15786 | |
| E | 13652 | |
| D | 8549 | 6.2% |
| 7940 | 5.7% | |
| N | 7201 | 5.2% |
| L | 6576 | 4.8% |
| ( | 6052 | 4.4% |
| ) | 6052 | 4.4% |
| Other values (12) | 30330 |
DEPARTAMENTAL
Categorical
HIGH CORRELATION 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.4 KiB |
| GUATEMALA NORTE | |
|---|---|
| GUATEMALA SUR | |
| GUATEMALA OCCIDENTE | |
| ESCUINTLA | |
| HUEHUETENANGO | |
| Other values (19) |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 12.072772 |
| Min length | 5 |
Characters and Unicode
| Total characters | 104019 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ALTA VERAPAZ |
|---|---|
| 2nd row | ALTA VERAPAZ |
| 3rd row | ALTA VERAPAZ |
| 4th row | ALTA VERAPAZ |
| 5th row | ALTA VERAPAZ |
Common Values
| Value | Count | Frequency (%) |
| GUATEMALA NORTE | 1037 | 12.0% |
| GUATEMALA SUR | 796 | 9.2% |
| GUATEMALA OCCIDENTE | 774 | 9.0% |
| ESCUINTLA | 599 | 7.0% |
| HUEHUETENANGO | 495 | 5.7% |
| QUETZALTENANGO | 476 | 5.5% |
| PETEN | 379 | 4.4% |
| SUCHITEPEQUEZ | 377 | 4.4% |
| GUATEMALA ORIENTE | 363 | 4.2% |
| IZABAL | 360 | 4.2% |
| Other values (14) | 2960 |
Length
| Value | Count | Frequency (%) |
| guatemala | 2970 | |
| norte | 1084 | 8.6% |
| sur | 796 | 6.3% |
| occidente | 774 | 6.2% |
| escuintla | 599 | 4.8% |
| huehuetenango | 495 | 3.9% |
| quetzaltenango | 476 | 3.8% |
| verapaz | 468 | 3.7% |
| peten | 379 | 3.0% |
| suchitepequez | 377 | 3.0% |
| Other values (17) | 4137 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 14108 | |
| T | 9811 | 9.4% |
| U | 8423 | 8.1% |
| N | 6351 | 6.1% |
| L | 6167 | 5.9% |
| G | 4412 | 4.2% |
| O | 4297 | 4.1% |
| C | 4114 | 4.0% |
| 3939 | 3.8% | |
| Other values (12) | 24832 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 100080 | |
| Space Separator | 3939 | 3.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 14108 | |
| T | 9811 | |
| U | 8423 | |
| N | 6351 | 6.3% |
| L | 6167 | 6.2% |
| G | 4412 | 4.4% |
| O | 4297 | 4.3% |
| C | 4114 | 4.1% |
| M | 3823 | 3.8% |
| Other values (11) | 21009 |
Space Separator
| Value | Count | Frequency (%) |
| 3939 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100080 | |
| Common | 3939 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 14108 | |
| T | 9811 | |
| U | 8423 | |
| N | 6351 | 6.3% |
| L | 6167 | 6.2% |
| G | 4412 | 4.4% |
| O | 4297 | 4.3% |
| C | 4114 | 4.1% |
| M | 3823 | 3.8% |
| Other values (11) | 21009 |
Common
| Value | Count | Frequency (%) |
| 3939 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104019 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 17565 | |
| E | 14108 | |
| T | 9811 | 9.4% |
| U | 8423 | 8.1% |
| N | 6351 | 6.1% |
| L | 6167 | 5.9% |
| G | 4412 | 4.2% |
| O | 4297 | 4.1% |
| C | 4114 | 4.0% |
| 3939 | 3.8% | |
| Other values (12) | 24832 |
CODIGO
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 8616 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 67.4 KiB |
ZONA
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 21 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 7080 |
| Missing (%) | 82.2% |
| Memory size | 67.4 KiB |
| ZONA 1 | |
|---|---|
| ZONA 7 | |
| ZONA 12 | |
| ZONA 18 | |
| ZONA 6 | |
| Other values (16) |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.3255208 |
| Min length | 6 |
Characters and Unicode
| Total characters | 9716 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ZONA 1 |
|---|---|
| 2nd row | ZONA 1 |
| 3rd row | ZONA 1 |
| 4th row | ZONA 1 |
| 5th row | ZONA 1 |
Common Values
| Value | Count | Frequency (%) |
| ZONA 1 | 628 | 7.3% |
| ZONA 7 | 173 | 2.0% |
| ZONA 12 | 114 | 1.3% |
| ZONA 18 | 102 | 1.2% |
| ZONA 6 | 71 | 0.8% |
| ZONA 11 | 62 | 0.7% |
| ZONA 2 | 54 | 0.6% |
| ZONA 19 | 53 | 0.6% |
| ZONA 13 | 46 | 0.5% |
| ZONA 3 | 40 | 0.5% |
| Other values (11) | 193 | 2.2% |
| (Missing) | 7080 |
Length
| Value | Count | Frequency (%) |
| zona | 1536 | |
| 1 | 628 | |
| 7 | 173 | 5.6% |
| 12 | 114 | 3.7% |
| 18 | 102 | 3.3% |
| 6 | 71 | 2.3% |
| 11 | 62 | 2.0% |
| 2 | 54 | 1.8% |
| 19 | 53 | 1.7% |
| 13 | 46 | 1.5% |
| Other values (12) | 233 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 | |
| 1536 | ||
| 1 | 1188 | |
| 2 | 202 | 2.1% |
| 7 | 193 | 2.0% |
| 8 | 107 | 1.1% |
| 6 | 89 | 0.9% |
| Other values (5) | 257 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6144 | |
| Decimal Number | 2036 | 21.0% |
| Space Separator | 1536 | 15.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1188 | |
| 2 | 202 | 9.9% |
| 7 | 193 | 9.5% |
| 8 | 107 | 5.3% |
| 6 | 89 | 4.4% |
| 3 | 86 | 4.2% |
| 9 | 81 | 4.0% |
| 5 | 44 | 2.2% |
| 0 | 27 | 1.3% |
| 4 | 19 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 |
Space Separator
| Value | Count | Frequency (%) |
| 1536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6144 | |
| Common | 3572 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1536 | ||
| 1 | 1188 | |
| 2 | 202 | 5.7% |
| 7 | 193 | 5.4% |
| 8 | 107 | 3.0% |
| 6 | 89 | 2.5% |
| 3 | 86 | 2.4% |
| 9 | 81 | 2.3% |
| 5 | 44 | 1.2% |
| 0 | 27 | 0.8% |
Latin
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Z | 1536 | |
| O | 1536 | |
| N | 1536 | |
| A | 1536 | |
| 1536 | ||
| 1 | 1188 | |
| 2 | 202 | 2.1% |
| 7 | 193 | 2.0% |
| 8 | 107 | 1.1% |
| 6 | 89 | 0.9% |
| Other values (5) | 257 | 2.6% |
| Unnamed: 0 | DEPARTAMENTO | SECTOR | AREA | STATUS | MODALIDAD | JORNADA | PLAN | DEPARTAMENTAL | ZONA | |
|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.790 | 0.117 | 0.173 | 0.105 | 0.152 | 0.117 | 0.082 | 0.843 | 0.925 |
| DEPARTAMENTO | 0.790 | 1.000 | 0.162 | 0.207 | 0.124 | 0.300 | 0.130 | 0.113 | 1.000 | 1.000 |
| SECTOR | 0.117 | 0.162 | 1.000 | 0.140 | 0.078 | 0.141 | 0.150 | 0.140 | 0.166 | 0.225 |
| AREA | 0.173 | 0.207 | 0.140 | 1.000 | 0.035 | 0.120 | 0.090 | 0.064 | 0.221 | 0.265 |
| STATUS | 0.105 | 0.124 | 0.078 | 0.035 | 1.000 | 0.027 | 0.161 | 0.134 | 0.131 | 0.126 |
| MODALIDAD | 0.152 | 0.300 | 0.141 | 0.120 | 0.027 | 1.000 | 0.095 | 0.082 | 0.304 | 0.000 |
| JORNADA | 0.117 | 0.130 | 0.150 | 0.090 | 0.161 | 0.095 | 1.000 | 0.561 | 0.137 | 0.093 |
| PLAN | 0.082 | 0.113 | 0.140 | 0.064 | 0.134 | 0.082 | 0.561 | 1.000 | 0.120 | 0.065 |
| DEPARTAMENTAL | 0.843 | 1.000 | 0.166 | 0.221 | 0.131 | 0.304 | 0.137 | 0.120 | 1.000 | 0.994 |
| ZONA | 0.925 | 1.000 | 0.225 | 0.265 | 0.126 | 0.000 | 0.093 | 0.065 | 0.994 | 1.000 |
| Unnamed: 0 | CODIGO | DISTRITO | DEPARTAMENTO | MUNICIPIO | ESTABLECIMIENTO | DIRECCION | TELEFONO | SUPERVISOR | DIRECTOR | NIVEL | SECTOR | AREA | STATUS | MODALIDAD | JORNADA | PLAN | DEPARTAMENTAL | CODIGO | ZONA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 16-01-0138-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO COBAN | KM.2 SALIDA A SAN JUAN CHAMELCO ZONA 8 | 77945104 | MERCEDES JOSEFINA TORRES GALVEZ | JULIO CESAR VILLELA AMADO | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 1 | 1 | 16-01-0139-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO PARTICULAR MIXTO VERAPAZ | KM 209.5 ENTRADA A LA CIUDAD | 77367402 | MERCEDES JOSEFINA TORRES GALVEZ | NaN | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 2 | 2 | 16-01-0140-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO "LA INMACULADA" | 7A. AVENIDA 11-109 ZONA 6 | 78232301 | MERCEDES JOSEFINA TORRES GALVEZ | VIRGINA SOLANO SERRANO | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 3 | 3 | 16-01-0141-46 | 16-005 | ALTA VERAPAZ | COBAN | ESCUELA NACIONAL DE CIENCIAS COMERCIALES | 2A CALLE 11-10 ZONA 2 | 79514215 | RUDY ADOLFO TOT OCH | NaN | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 4 | 4 | 16-01-0142-46 | 16-005 | ALTA VERAPAZ | COBAN | INSTITUTO NORMAL MIXTO DEL NORTE 'EMILIO ROSALES PONCE' | 3A AVE 6-23 ZONA 11 | 79521468 | RUDY ADOLFO TOT OCH | NaN | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | BILINGUE | VESPERTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 5 | 5 | 16-01-0143-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO PARTICULAR MIXTO IMPERIAL | 5A. CALLE 1-9 ZONA 3 | 57101061 | MERCEDES JOSEFINA TORRES GALVEZ | HECOTR WALDEMAR TOT COY | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | FIN DE SEMANA | ALTA VERAPAZ | NaN | NaN |
| 6 | 6 | 16-01-0145-46 | 16-006 | ALTA VERAPAZ | COBAN | INSTITUTO DE TURSMO Y AVIACON DEL NORTE I.T.A.N | 3 AV. 5-28 ZONA 4 | 54641454 | EFRAIN CAAL CUC | LUIS FERNANDO SOTO | DIVERSIFICADO | PRIVADO | URBANA | CERRADA TEMPORALMENTE | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 7 | 7 | 16-01-0147-46 | 16-031 | ALTA VERAPAZ | COBAN | COLEGIO "LA INMACULADA" | 7A. CALLE 11-09 ZONA 6 COBAN | 49532425 | MERCEDES JOSEFINA TORRES GALVEZ | MERCEDES QUIROS QUIROS | DIVERSIFICADO | PRIVADO | RURAL | CERRADA TEMPORALMENTE | MONOLINGUE | DOBLE | DIARIO(REGULAR) | ALTA VERAPAZ | NaN | NaN |
| 8 | 8 | 16-01-0150-46 | 16-006 | ALTA VERAPAZ | COBAN | INSTITUTO INTERCULTRUAL ALTAVERAPACENCESE -IIAV- | 3A. AVAENIDA 1-23 ZONA 4 | NaN | EFRAIN CAAL CUC | GUILLERMO ESTUARDO VASQUEZ MORALES | DIVERSIFICADO | PRIVADO | URBANA | CERRADA TEMPORALMENTE | BILINGUE | DOBLE | FIN DE SEMANA | ALTA VERAPAZ | NaN | NaN |
| 9 | 9 | 16-01-0155-46 | 16-031 | ALTA VERAPAZ | COBAN | LICEO "MODERNO LATINO" | 11 AVENIDA 5-17 ZONA 4 | 79522555 | MERCEDES JOSEFINA TORRES GALVEZ | JORGE BENEDICTO COC POP | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | FIN DE SEMANA | ALTA VERAPAZ | NaN | NaN |
| Unnamed: 0 | CODIGO | DISTRITO | DEPARTAMENTO | MUNICIPIO | ESTABLECIMIENTO | DIRECCION | TELEFONO | SUPERVISOR | DIRECTOR | NIVEL | SECTOR | AREA | STATUS | MODALIDAD | JORNADA | PLAN | DEPARTAMENTAL | CODIGO | ZONA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8606 | 9006 | 12-29-4728-46 | 12-053 | SAN MARCOS | SAN LORENZO | INSTITUTO DIVERSIFICADO 'DR. JUAN JOSE AREVALO BERMEJO' | SAN LORENZO | NaN | AMILCAR ROCAEL VELASQUEZ OROZCO | NaN | DIVERSIFICADO | PRIVADO | URBANA | CERRADA TEMPORALMENTE | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | SAN MARCOS | NaN | NaN |
| 8607 | 9007 | 12-29-5026-46 | 12-053 | SAN MARCOS | SAN LORENZO | INSTITUTO TECNOLOGICO PRIVADO MIXTO EVANGELICO SUNESIS | SAN LORENZO | 51786859 | AMILCAR ROCAEL VELASQUEZ OROZCO | EMIR OSBELI FUENTES VASQUEZ | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | MATUTINA | DIARIO(REGULAR) | SAN MARCOS | NaN | NaN |
| 8608 | 9008 | 12-30-0046-46 | 12-107 | SAN MARCOS | LA BLANCA | INSTITUTO NACIONAL DE EDUCACION DIVERSIFICADA | CABECERA MUNICIPAL | 41016246 | JUAN JOSE TOBAR TEBALAN | WALTER RENE PEREZ Y PEREZ | DIVERSIFICADO | OFICIAL | URBANA | ABIERTA | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | SAN MARCOS | NaN | NaN |
| 8609 | 9009 | 12-30-0051-46 | 12-107 | SAN MARCOS | LA BLANCA | COLEGIO ADVENTISTA MARANATHA | CABECERA MUNICIPAL | 49582374 | JUAN JOSE TOBAR TEBALAN | SONIA NOEMI GARCIA FUENTES DE AGUIRRE | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | SAN MARCOS | NaN | NaN |
| 8610 | 9010 | 12-30-0052-46 | 12-107 | SAN MARCOS | LA BLANCA | COLEGIO ADVENTISTA MARANATHA | CABECERA MUNICIPAL | 49582374 | JUAN JOSE TOBAR TEBALAN | SONIA NOEMI GARCIA FUENTES DE AGUIRRE | DIVERSIFICADO | PRIVADO | URBANA | CERRADA TEMPORALMENTE | MONOLINGUE | DOBLE | DOMINICAL | SAN MARCOS | NaN | NaN |
| 8611 | 9011 | 12-30-0056-46 | 12-107 | SAN MARCOS | LA BLANCA | COLEGIO PRIVADO URBANO MIXTO LICEO MODERNO | CABECERA MUNICIPAL | 58899435 | JUAN JOSE TOBAR TEBALAN | MELVI WALDIR HURTADO CIFUENTES | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | VESPERTINA | DIARIO(REGULAR) | SAN MARCOS | NaN | NaN |
| 8612 | 9012 | 12-30-0057-46 | 12-107 | SAN MARCOS | LA BLANCA | COLEGIO PRIVADO URBANO MIXTO LICEO MODERNO | CABECERA MUNICIPAL | 58899435 | JUAN JOSE TOBAR TEBALAN | MELVI WALDIR HURTADO CIFUENTES | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | DOBLE | FIN DE SEMANA | SAN MARCOS | NaN | NaN |
| 8613 | 9020 | 12-30-0085-46 | 12-093 | SAN MARCOS | LA BLANCA | COLEGIO ADVENTISTA MARANATHA | CABECERA MUNICIPAL | 49582374 | JUAN JOSE TOBAR TEBALAN | USIELA SARONITA CARRETO LOPEZ | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | SIN JORNADA | SEMIPRESENCIAL (UN DIA A LA SEMANA) | SAN MARCOS | NaN | NaN |
| 8614 | 9021 | 12-30-0087-46 | 12-107 | SAN MARCOS | LA BLANCA | COLEGIO EDUCATIVO MIXTO LUNA AZUL -CEMLA- | PARCELAMIENTO CHIQUIRINES | 45454204 | JUAN JOSE TOBAR TEBALAN | BETY PATRICIA OXLAJ PEREZ | DIVERSIFICADO | PRIVADO | RURAL | CERRADA TEMPORALMENTE | MONOLINGUE | SIN JORNADA | SEMIPRESENCIAL (UN DIA A LA SEMANA) | SAN MARCOS | NaN | NaN |
| 8615 | 9022 | 12-30-0089-46 | 12-107 | SAN MARCOS | LA BLANCA | COLEGIO PRIVADO URBANO MIXTO LICEO MODERNO | CABECERA MUNICIPAL | 36121181 | JUAN JOSE TOBAR TEBALAN | MAVERIK GEYSTYNG HURTADO CIFUENTES | DIVERSIFICADO | PRIVADO | URBANA | ABIERTA | MONOLINGUE | SIN JORNADA | SEMIPRESENCIAL (UN DIA A LA SEMANA) | SAN MARCOS | NaN | NaN |